Aside

Contact

Publications

Computational Skills

R
Bash
Python
CWL workflow
Docker
AWS

Disclaimer

Made with the R package pagedown.

The source code is available on github.com/seifudd/cv.

Last updated on 2025-02-18.

Main

Fayaz Seifuddin

Background

Experienced bioinformatics leader with 15+ years in computational genomics, data science, and machine learning, driving innovation in multi-omics research

N/A

N/A

N/A

Proven track record in developing and overseeing bioinformatics strategies, optimizing data processing pipelines for genomics, transcriptomics, and epigenomics

N/A

N/A

N/A

Expertise in managing large-scale NGS datasets, ensuring data integrity, reproducibility, and actionable insights for translational and clinical applications

N/A

N/A

N/A

Strong background in leading cross-functional teams, collaborating with researchers, clinicians, and data scientists to advance bioinformatics initiatives

N/A

N/A

N/A

Published scientist with a history of driving impactful research, translating complex genomic data into meaningful discoveries

N/A

N/A

N/A

Employment

Senior Scientist in Computational Genomics

Bristol Myers Squibb

Lawrenceville, NJ

Current - 2021

Bioinformatics Staff Scientist

National Institutes of Health (NIH), National Heart, Lung, and Blood Institute (NHLBI)

Bethesda, MD

2021 - 2019

Senior Bioinformatics Research Data Analyst

Mood Disorders Center, Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, School of Medicine

Baltimore, MD

2016 - 2012

Bioinformatics Research Data Analyst

Mood Disorders Center, Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, School of Medicine

Baltimore, MD

2012 - 2008

Bioinformatics Intern

J. Craig Venter Institute (JCVI)

Rockville, MD

2008 - 2007

Education

PhD., Bioinformatics and Computational Biology

George Mason University

Fairfax, VA

2019 - 2016

  • Dissertation: lncRNAKB- A Knowledgebase of Tissue-specific Functional Annotation and Trait Association of Long Noncoding RNA

M.S., Bioinformatics and Computational Biology

George Mason University

Fairfax, VA

2008 - 2006

  • Thesis: Identification of Lung Adenocarcinoma Progression Biomarkers

B.S., Computer Science (minor-Mathematics)

Linfield University

McMinnville, OR

2005 - 2001

Industry & Academic Experience

Senior Scientist in Computational Genomics

Bristol Myers Squibb

Lawrenceville, NJ

Current - 2021

  • Leading NGS data analysis for clinical trials, exploratory, and research studies, deriving actionable insights for precision medicine
  • Developing and optimizing bioinformatics pipelines, integrating AWS, Docker, CWL, in Seven Bridges and Arvados platforms for scalable, reproducible workflows
  • Directing innovative research projects on Microsatellite Instability (MSI), alternative splicing analysis, neoantigen prediction, Tumor Mutational Burden (TMB), genetic ancestry analysis, cfDNA nucleosome profiling and RNA-Seq lite in clinical applications
  • Designing novel tools and workflows to support long-read sequencing (PacBio) data and advanced transcriptomics analyses

Bioinformatics Staff Scientist & PhD. Research Fellow (in Bioinformatics and Computational Biology)

National Institutes of Health (NIH), National Heart, Lung, and Blood Institute (NHLBI)

Bethesda, MD

2021 - 2016

  • RNA-Seq (Transcriptomics): Led large-scale transcriptomic analyses (GTEx, 11,000+ samples), developing HPCC pipelines for expression quantification, differential analysis, pathway enrichment, eQTL and sQTL mapping
  • DNA-Seq (Genomics): Designed and implemented HPCC workflows for WGS, WES, targeted DNA-seq, and CRISPR studies, including germline/somatic mutation calling, CNV detection, and mutational burden analysis
  • Methyl-Seq (Epigenomics): Built WGBS pipelines for genomic and cfDNA analysis, performing CpG methylation analysis, deconvolution for tissue-of-origin identification, and processing ChIP-Seq, CUT&Tag, and CUT&RUN data
  • Proteomics: Applied machine learning models (random forest and mixed-effects modeling) to predict cardiovascular risk using high-density lipoprotein proteome data
  • lncRNA Analysis: Developed lncRNAKB (https://osf.io/ru4d2/), a knowledgebase integrating lncRNA tissue-specificity and disease associations, incorporating cis-eQTL, GWAS associations (UK Biobank), and co-expression modules from GTEx RNA-Seq data

Senior Bioinformatics Research Data Analyst

Mood Disorders Center, Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, School of Medicine

Baltimore, MD

2016 - 2012

  • Led a systematic review and mega-analysis of microarray gene expression studies in brain tissue for mood disorders
  • Developed Metamoodics, a web-based bioinformatics platform integrating genome-wide linkage, expression, and association data for Major Depression and Bipolar Disorder
  • Conducted polygenic risk score (PRS) analysis, linking genetic predisposition to stressful life events and depressive symptoms in older adults
  • Designed and implemented HPCC pipelines for epigenetic data analysis, including Whole Genome Bisulfite Sequencing (WGBS), to study mood disorder associations

Bioinformatics Research Data Analyst

Mood Disorders Center, Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, School of Medicine

Baltimore, MD

2012 - 2008

  • Designed and executed scalable bioinformatics pipelines on High-Performance Computing Clusters (HPCCs) for GWAS analysis of mood disorders, integrating Illumina & Affymetrix array data
  • Led imputation and statistical analysis of GWAS data, enhancing the resolution of genetic association studies
  • Conducted a comprehensive meta-analysis of candidate gene studies in Bipolar Disorder, identifying statistically significant polymorphisms
  • Applied machine learning and data mining techniques to analyze high-dimensional genomic data from psychiatric disorders
  • Developed analytical workflows for genome-wide DNA methylation (DNAm) analysis

Bioinformatics Intern

J. Craig Venter Institute (JCVI)

Rockville, MD

2008 - 2007

  • Developed SMURF (http://smurf.jcvi.org/index.php), a web-based tool for genomic mapping of fungal secondary metabolite clusters, enabling systematic prediction of biosynthetic gene clusters
  • Designed algorithms to analyze genomic sequences and identify gene clusters responsible for biosynthesis, export, and transcriptional regulation of secondary metabolites
  • Enhanced data-driven insights into fungal metabolomics, providing researchers with a streamlined approach to studying secondary metabolism

Teaching Experience

Trainer, Bioinformatics Analysis for NGS Data

The NHLBI Workshop Series

Bethesda, MD

2019

  • Trained 20 staff scientists on NGS platforms and data analysis techniques
  • Delivered sessions on data quality assessment (FastQC), preprocessing (Trimmomatic, Cutadapt), and alignment (BWA, Bowtie2)
  • Taught variant calling (GATK, samtools) and annotation (Annovar, VEP)
  • Guided gene expression analysis (Salmon, Kallisto, DESeq2, edgeR)
  • Introduced data visualization tools (IGV, R/Bioconductor)
  • Covered functional enrichment and pathway analysis (DAVID, GOseq)
  • Emphasized best practices, troubleshooting, and validation in NGS analysis

Guest Lecturer

George Mason University, Department of Bioinformatics and Computational Biology

Fairfax, VA

2019 - 2017

  • Lectured M.S. and Ph.D. students on current trends, methods, and analysis of biological sequence and NGS functional genomics data
  • Engaged and instructed a class of 15 students

Tutor Coordinator & Mentor

Dallas College North Lake Campus, TRiO Upward Bound

Dallas, TX

2006 - 2005

  • Hired, trained, and mentored a team of 10 tutors for the Upward Bound program, supporting first-generation and low-income high school students in college preparation

Teaching & Lab Assistant

Linfield University, Department of Computer Science

McMinnville, OR

2005 - 2004

  • Designed, assisted, and graded C++ and Java programming assignments for 20 first- and second-year students in Programming and Data Structures courses

Tutor

Linfield University, TRiO Upward Bound

McMinnville, OR

2005 - 2001

  • Tutored 60 students in Math and Science, focusing on Algebra I & II, Geometry, Pre-Calculus, and Calculus

Honors and Awards

NIH Pre-Doctoral Intramural Research Training Award (IRTA)

NHLBI

Bethesda, MD

2019 - 2016

Graduate Student Research Award in Bioinformatics / Biostatistics / Epidemiology / Computational & Systems Biology

NIH, Graduate Student Research Symposium

Bethesda, MD

2019

Outstanding Digital Presentation Award in Bioinformatics / Computational & Systems Biology

George Mason University, Student Research Day

Fairfax, VA

2019

Reviewers Choice Poster Abstracts, Bioinformatics & Computational Biology

The American Society of Human Genetics (ASHG)

Houston, TX

2019

Sumie Yamamoto Endowed International Students Scholarship

Linfield University

McMinnville, OR

2005 - 2001